AITopics | Niger State

Collaborating Authors

Niger State

NaijaNLP: A Survey of Nigerian Low-Resource Languages

arXiv.org Artificial IntelligenceMar-6-2025

With over 500 languages in Nigeria, three languages -- Hausa, Yor\`ub\'a and Igbo -- spoken by over 175 million people, account for about 60% of the spoken languages. However, these languages are categorised as low-resource due to insufficient resources to support tasks in computational linguistics. Several research efforts and initiatives have been presented, however, a coherent understanding of the state of Natural Language Processing (NLP) - from grammatical formalisation to linguistic resources that support complex tasks such as language understanding and generation is lacking. This study presents the first comprehensive review of advancements in low-resource NLP (LR-NLP) research across the three major Nigerian languages (NaijaNLP). We quantitatively assess the available linguistic resources and identify key challenges. Although a growing body of literature addresses various NLP downstream tasks in Hausa, Igbo, and Yor\`ub\'a, only about 25.1% of the reviewed studies contribute new linguistic resources. This finding highlights a persistent reliance on repurposing existing data rather than generating novel, high-quality resources. Additionally, language-specific challenges, such as the accurate representation of diacritics, remain under-explored. To advance NaijaNLP and LR-NLP more broadly, we emphasise the need for intensified efforts in resource enrichment, comprehensive annotation, and the development of open collaborative initiatives.

arxiv preprint arxiv, dataset, naijanlp, (14 more...)

arXiv.org Artificial Intelligence

2502.19784

Country:

Africa > Niger (0.14)
Africa > Cameroon (0.14)
Africa > Nigeria > Jigawa State > Dutse (0.05)
(29 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Health & Medicine (1.00)
Information Technology > Security & Privacy (0.46)
Media > News (0.46)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
(3 more...)

Add feedback

Semantic Component Analysis: Discovering Patterns in Short Texts Beyond Topics

Eichin, Florian, Schuster, Carolin M., Groh, Georg, Hedderich, Michael A.

arXiv.org Artificial IntelligenceDec-16-2024

Topic modeling is a key method in text analysis, but existing approaches are limited by assuming one topic per document or fail to scale efficiently for large, noisy datasets of short texts. We introduce Semantic Component Analysis (SCA), a novel topic modeling technique that overcomes these limitations by discovering multiple, nuanced semantic components beyond a single topic in short texts which we accomplish by introducing a decomposition step to the clustering-based topic modeling framework. We evaluate SCA on Twitter datasets in English, Hausa and Chinese. It achieves competetive coherence and diversity compared to BERTopic, while uncovering at least double the semantic components and maintaining a noise rate close to zero. Furthermore, SCA is scalable and effective across languages, including an underrepresented one.

bertopic, dataset, semantic component, (13 more...)

arXiv.org Artificial Intelligence

2410.21054

Country:

Asia > Russia (0.28)
Asia > China (0.04)
North America > Canada (0.04)
(21 more...)

Genre: Research Report (1.00)

Industry:

Media > News (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Leisure & Entertainment > Sports (0.93)
(3 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

Bridging Relevance and Reasoning: Rationale Distillation in Retrieval-Augmented Generation

Jia, Pengyue, Xu, Derong, Li, Xiaopeng, Du, Zhaocheng, Li, Xiangyang, Zhao, Xiangyu, Wang, Yichao, Wang, Yuhao, Guo, Huifeng, Tang, Ruiming

arXiv.org Artificial IntelligenceDec-11-2024

The reranker and generator are two critical components in the Retrieval-Augmented Generation (i.e., RAG) pipeline, responsible for ranking relevant documents and generating responses. However, due to differences in pre-training data and objectives, there is an inevitable gap between the documents ranked as relevant by the reranker and those required by the generator to support answering the query. To address this gap, we propose RADIO, a novel and practical preference alignment framework with RAtionale DIstillatiOn. Specifically, We first propose a rationale extraction method that leverages the reasoning capabilities of Large Language Models (LLMs) to extract the rationales necessary for answering the query. Subsequently, a rationale-based alignment process is designed to rerank the documents based on the extracted rationales, and fine-tune the reranker to align the preferences. We conduct extensive experiments on two tasks across three datasets to demonstrate the effectiveness of our approach compared to baseline methods. Our code is released online to ease reproduction.

large language model, machine learning, reranker, (20 more...)

arXiv.org Artificial Intelligence

2412.08519

Country:

Africa > Nigeria > Niger State (0.05)
Africa > Nigeria > Lagos State (0.05)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
(4 more...)

Genre: Research Report (0.82)

Industry:

Leisure & Entertainment (1.00)
Media > Music (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Anomaly Detection in California Electricity Price Forecasting: Enhancing Accuracy and Reliability Using Principal Component Analysis

Nyangon, Joseph, Akintunde, Ruth

arXiv.org Artificial IntelligenceNov-25-2024

Accurate and reliable electricity price forecasting has significant practical implications for grid management, renewable energy integration, power system planning, and price volatility management. This study focuses on enhancing electricity price forecasting in California's grid, addressing challenges from complex generation data and heteroskedasticity. Utilizing principal component analysis (PCA), we analyze CAISO's hourly electricity prices and demand from 2016-2021 to improve day-ahead forecasting accuracy. Initially, we apply traditional outlier analysis with the interquartile range method, followed by robust PCA (RPCA) for more effective outlier elimination. This approach improves data symmetry and reduces skewness. We then construct multiple linear regression models using both raw and PCA-transformed features. The model with transformed features, refined through traditional and SAS Sparse Matrix outlier removal methods, shows superior forecasting performance. The SAS Sparse Matrix method, in particular, significantly enhances model accuracy. Our findings demonstrate that PCA-based methods are key in advancing electricity price forecasting, supporting renewable integration and grid management in day-ahead markets. Keywords: Electricity price forecasting, principal component analysis (PCA), power system planning, heteroskedasticity, renewable energy integration.

data mining, forecasting, machine learning, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1002/wene.504

2412.07787

Country:

North America > United States > North Carolina > Wake County > Cary (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > District of Columbia > Washington (0.04)
(5 more...)

Genre: Research Report > New Finding (0.86)

Industry:

Energy > Renewable > Solar (1.00)
Energy > Power Industry > Utilities (0.93)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.82)

Add feedback

'No' Matters: Out-of-Distribution Detection in Multimodality Long Dialogue

Gao, Rena, Wu, Xuetong, Luo, Siwen, Han, Caren, Liu, Feng

arXiv.org Artificial IntelligenceOct-31-2024

Out-of-distribution (OOD) detection in multimodal contexts is essential for identifying deviations in combined inputs from different modalities, particularly in applications like open-domain dialogue systems or real-life dialogue interactions. This paper aims to improve the user experience that involves multi-round long dialogues by efficiently detecting OOD dialogues and images. We introduce a novel scoring framework named Dialogue Image Aligning and Enhancing Framework (DIAEF) that integrates the visual language models with the novel proposed scores that detect OOD in two key scenarios (1) mismatches between the dialogue and image input pair and (2) input pairs with previously unseen labels. Our experimental results, derived from various benchmarks, demonstrate that integrating image and multi-round dialogue OOD detection is more effective with previously unseen labels than using either modality independently. In the presence of mismatched pairs, our proposed score effectively identifies these mismatches and demonstrates strong robustness in long dialogues. This approach enhances domain-aware, adaptive conversational agents and establishes baselines for future studies.

detection, dialogue, ood detection, (14 more...)

arXiv.org Artificial Intelligence

2410.23883

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Oceania > New Zealand > South Island > Canterbury Region > Christchurch (0.04)
Oceania > Australia > Western Australia (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(4 more...)

Add feedback

Improving the accuracy of food security predictions by integrating conflict data

Bertetti, Marco, Agnolucci, Paolo, Calzadilla, Alvaro, Capra, Licia

arXiv.org Artificial IntelligenceOct-12-2024

Food security (FS) is a complex and multifaceted problem, influenced by several factors such as weather events, economic shocks, and natural disasters. Understanding the dynamics of food security is crucial for effective policymaking and humanitarian efforts. While conflicts and violent events increasingly stand out as key drivers of food crises[1], the depth of their impact remains largely underexplored. Examining the quantitative aspects of this impact is essential for developing more targeted interventions and strategies to address the complex interplay between conflict and food security. Existing research tends to be qualitative in nature (Kemmerling et al.2022; Brown et al. 2020; Brown et al. 2021), leaving a significant gap in understanding the quantitative aspects of how conflicts impact FS levels. By delving into quantitative analyses, we can not only enhance our comprehension of the magnitude of the problem but also pave the way for evidence-based decision-making in efforts to alleviate food insecurity in conflict-affected regions. Regarding the qualitative study of conflicts and FS, Kemmerling et al.(2022)[2] provided a comprehensive explanation on how violence and armed conflicts impact FS through destruction, displacement, financing of conflicts and food being used as a weapon. The authors call for better conflict data collection, and an increase in focus on the study of conflicts early warnings.

artificial intelligence, conflict, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2410.22342

Country:

North America > United States (0.93)
Africa > Ethiopia (0.29)
Asia > Russia (0.28)
(12 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Food & Agriculture > Agriculture (1.00)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Multilingual Transfer and Domain Adaptation for Low-Resource Languages of Spain

Luo, Yuanchang, Wu, Zhanglin, Wei, Daimeng, Shang, Hengchao, Li, Zongyao, Guo, Jiaxin, Rao, Zhiqiang, Li, Shaojun, Yang, Jinlong, Xie, Yuhao, Wei, Jiawei Zheng Bin, Yang, Hao

arXiv.org Artificial IntelligenceSep-29-2024

This article introduces the submission status of the Translation into Low-Resource Languages of Spain task at (WMT 2024) by Huawei Translation Service Center (HW-TSC). We participated in three translation tasks: spanish to aragonese (es-arg), spanish to aranese (es-arn), and spanish to asturian (es-ast). For these three translation tasks, we use training strategies such as multilingual transfer, regularized dropout, forward translation and back translation, labse denoising, transduction ensemble learning and other strategies to neural machine translation (NMT) model based on training deep transformer-big architecture. By using these enhancement strategies, our submission achieved a competitive result in the final evaluation.

machine translation, translation, translation task, (15 more...)

arXiv.org Artificial Intelligence

2409.15924

Country:

Europe > Spain (0.62)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
Asia > China > Beijing > Beijing (0.04)
Africa > Nigeria > Niger State > Minna (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Machine Translation Advancements of Low-Resource Indian Languages by Transfer Learning

Wei, Bin, Zhen, Jiawei, Li, Zongyao, Wu, Zhanglin, Wei, Daimeng, Guo, Jiaxin, Rao, Zhiqiang, Li, Shaojun, Luo, Yuanchang, Shang, Hengchao, Yang, Jinlong, Xie, Yuhao, Yang, Hao

arXiv.org Artificial IntelligenceSep-24-2024

This paper introduces the submission by Huawei Translation Center (HW-TSC) to the WMT24 Indian Languages Machine Translation (MT) Shared Task. To develop a reliable machine translation system for low-resource Indian languages, we employed two distinct knowledge transfer strategies, taking into account the characteristics of the language scripts and the support available from existing open-source models for Indian languages. For Assamese(as) and Manipuri(mn), we fine-tuned the existing IndicTrans2 open-source model to enable bidirectional translation between English and these languages. For Khasi (kh) and Mizo (mz), We trained a multilingual model as a baseline using bilingual data from these four language pairs, along with an additional about 8kw English-Bengali bilingual data, all of which share certain linguistic features. This was followed by fine-tuning to achieve bidirectional translation between English and Khasi, as well as English and Mizo. Our transfer learning experiments produced impressive results: 23.5 BLEU for en-as, 31.8 BLEU for en-mn, 36.2 BLEU for as-en, and 47.9 BLEU for mn-en on their respective test sets. Similarly, the multilingual model transfer learning experiments yielded impressive outcomes, achieving 19.7 BLEU for en-kh, 32.8 BLEU for en-mz, 16.1 BLEU for kh-en, and 33.9 BLEU for mz-en on their respective test sets. These results not only highlight the effectiveness of transfer learning techniques for low-resource languages but also contribute to advancing machine translation capabilities for low-resource Indian languages.

machine translation, proceedings, translation, (14 more...)

arXiv.org Artificial Intelligence

2409.15879

Country:

North America > Canada > Ontario > Toronto (0.04)
Europe > Czechia > Prague (0.04)
Asia > China > Beijing > Beijing (0.04)
Africa > Nigeria > Niger State > Minna (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Choose the Final Translation from NMT and LLM hypotheses Using MBR Decoding: HW-TSC's Submission to the WMT24 General MT Shared Task

Wu, Zhanglin, Wei, Daimeng, Li, Zongyao, Shang, Hengchao, Guo, Jiaxin, Li, Shaojun, Rao, Zhiqiang, Luo, Yuanchang, Xie, Ning, Yang, Hao

arXiv.org Artificial IntelligenceSep-23-2024

This paper presents the submission of Huawei Translate Services Center (HW-TSC) to the WMT24 general machine translation (MT) shared task, where we participate in the English to Chinese (en2zh) language pair. Similar to previous years' work, we use training strategies such as regularized dropout, bidirectional training, data diversification, forward translation, back translation, alternated training, curriculum learning, and transductive ensemble learning to train the neural machine translation (NMT) model based on the deep Transformer-big architecture. The difference is that we also use continue pre-training, supervised fine-tuning, and contrastive preference optimization to train the large language model (LLM) based MT model. By using Minimum Bayesian risk (MBR) decoding to select the final translation from multiple hypotheses for NMT and LLM-based MT models, our submission receives competitive results in the final evaluation.

computational linguistic, machine translation, translation, (11 more...)

arXiv.org Artificial Intelligence

2409.148

Country:

North America > Canada > Ontario > Toronto (0.04)
Europe > Czechia > Prague (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
(2 more...)

Genre:

Research Report (0.50)
Instructional Material > Course Syllabus & Notes (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Digital Twinning of a Pressurized Water Reactor Startup Operation and Partial Computational Offloading in In-network Computing-Assisted Multiaccess Edge Computing

Aliyu, Ibrahim, Arigi, Awwal M., Um, Tai-Won, Kim, Jinsul

arXiv.org Artificial IntelligenceJun-24-2024

This paper addresses the challenge of representing complex human action (HA) in a nuclear power plant (NPP) digital twin (DT) and minimizing latency in partial computation offloading (PCO) in sixth-generation-enabled computing in the network (COIN) assisted multiaccess edge computing (MEC). Accurate HA representation in the DT-HA model is vital for modeling human interventions that are crucial for the safe and efficient operation of NPPs. In this context, DT-enabled COIN-assisted MEC harnesses DT (known as a cybertwin) capabilities to optimize resource allocation and reduce latency effectively. A two-stage approach is employed to address system complexity. First, a probabilistic graphical model (PGM) is introduced to capture HAs in the DT abstraction. In the PGM, HA and NPP asset-twin abstractions form coupled systems that evolve and interact through observable data and control input. Next, the underlying PCO problem is formulated as a multiuser game, where NPP assets can partially offload tasks to COIN and MEC. We propose a decentralized algorithm to optimize offloading decisions, offloading ratios, and resource allocation. The simulation results demonstrate the effectiveness of the proposed method in capturing complex HAs and optimal resource allocation in DT-enabled NPPs.

opération, probability, subsystem, (16 more...)

arXiv.org Artificial Intelligence

2407.12011

Country:

Asia > South Korea > Daejeon > Daejeon (0.04)
Europe > Norway (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
(6 more...)

Genre: Research Report (0.84)

Industry:

Information Technology > Security & Privacy (1.00)
Energy > Power Industry > Utilities > Nuclear (1.00)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback